Effective sketching methods for value function approximation

نویسندگان

  • Yangchen Pan
  • Erfan Sadeqi Azer
  • Martha White
چکیده

High-dimensional representations, such as radial basis function networks or tile coding, are common choices for policy evaluation in reinforcement learning. Learning with such high-dimensional representations, however, can be expensive, particularly for matrix methods, such as least-squares temporal difference learning or quasi-Newton methods that approximate matrix step-sizes. In this work, we explore the utility of sketching for these two classes of algorithms. We highlight issues with sketching the high-dimensional features directly, which can incur significant bias. As a remedy, we demonstrate how to use sketching more sparingly, with only a left-sided sketch, that can still enable significant computational gains and the use of these matrix-based learning algorithms that are less sensitive to parameters. We empirically investigate these algorithms, in four domains with a variety of representations. Our aim is to provide insights into effective use of sketching in practice.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Tensor Sketching: Sparsification and Rank-One Projection

In this paper, we investigate effective sketching schemes for high dimensional multilinear arrays or tensors. More specifically, we propose a novel tensor sparsification algorithm that retains a subset of the entries of a tensor in a judicious way, and prove that it can attain a given level of approximation accuracy in terms of tensor spectral norm with a much smaller sample complexity when com...

متن کامل

Sharper Bounds for Regularized Data Fitting

We study matrix sketching methods for regularized variants of linear regression, low rank approximation, and canonical correlation analysis. Our main focus is on sketching techniques which preserve the objective function value for regularized problems, which is an area that has remained largely unexplored. We study regularization both in a fairly broad setting, and in the specific context of th...

متن کامل

An ${cal O}(h^{8})$ optimal B-spline collocation for solving higher order boundary value problems

As we know the approximation solution of seventh order two points boundary value problems based on B-spline of degree eight has only ${cal O}(h^{2})$ accuracy and this approximation is non-optimal. In this work, we obtain an optimal spline collocation method for solving the general nonlinear seventh order two points boundary value problems. The ${cal O}(h^{8})$ convergence analysis, mainly base...

متن کامل

Randomized Block Krylov Methods for Stronger and Faster Approximate Singular Value Decomposition

Since being analyzed by Rokhlin, Szlam, and Tygert [1] and popularized by Halko, Martinsson, and Tropp [2], randomized Simultaneous Power Iteration has become the method of choice for approximate singular value decomposition. It is more accurate than simpler sketching algorithms, yet still converges quickly for any matrix, independently of singular value gaps. After Õ(1/ ) iterations, it gives ...

متن کامل

Lecture 7 : Sampling / Projections for Least - squares Approximation , Cont .

We continue with the disucssion from last time. There is no new reading, just the same as last class. Recall that last time we provided a brief overview of LS problems and a brief overview of sketching methods for LS problems. For the latter, we provided a lemma that showed that under certain conditions the solution of a sketched LS problem was a good approximation to the solution of the origin...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • CoRR

دوره abs/1708.01298  شماره 

صفحات  -

تاریخ انتشار 2017